Bark and ERB bilinear transforms

نویسندگان

  • Julius O. Smith
  • Jonathan S. Abel
چکیده

Use of a bilinear conformal map to achieve a frequency warping nearly identical to that of the Bark frequency scale is described. Because the map takes the unit circle to itself, its form is that of the transfer function of a first-order allpass filter. Since it is a first-order map, it preserves the model order of rational systems, making it a valuable frequency warping technique for use in audio filter design. A closed-form weighted-equation-error method is derived that computes the optimal mapping coefficient as a function of sampling rate, and the solution is shown to be generally indistinguishable from the optimal least-squares solution. The optimal Chebyshev mapping is also found to be essentially identical to the optimal least-squares solution. The expression 0:8517 [arctan(0:06583fs)] 0:916 is shown to accurately approximate the optimal allpass coefficient as a function of sampling rate fs in kHz for sampling rates greater than 1 kHz. A filter design example is included that illustrates improvements due to carrying out the design over a Bark scale. Corresponding results are also given and compared for approximating the related “equivalent rectangular bandwidth (ERB) scale” of Moore and Glasberg using a first-order allpass transformation. Due to the higher frequency resolution called for by the ERB scale, particularly at low frequencies, the first-order conformal map is less able to follow the desired mapping, and the error is two to three times greater than the Bark-scale case, depending on the sampling rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Auditory Scale Analysis and Evaluation of Phonemes in MISING Language

Frequency analyzer is one of the important functions of peripheral auditory system. Psycho-acoustically this gives rise to the concept of critical band, which represents the frequency resolution of the auditory system. Mel-Scale warping is one of the common techniques used for the analysis in speech recognition. Bark and ERB (Equivalent Rectangular Bandwidth) rate scales are two other auditory ...

متن کامل

A New Method of Objective Speech Quality Assessment in Communication System

On the Quality of Experience (QoE) evaluation of communication system, the quality of speech is an important factor to evaluate the system. Perceptual evaluation of speech quality (PESQ) is a well known objective speech quality assessment method for the voice QoE evaluation. It is proposed by International Telecommunication Union (ITU) and is formed as the ITU-T P.862 Recommendations. PESQ appl...

متن کامل

Conditional gauge theorem for non - local Feynman - Kac transforms

Feynman-Kac transforms driven by discontinuous additive functionals are studied in this paper for a large class of Markov processes. General gauge and conditional gauge theorems are established for such transforms. Furthermore, the L2-infinitesimal generator of the Schrödinger semigroup given by a non-local Feynman-Kac transform is determined in terms of its associated bilinear form.

متن کامل

Auditory-Based Features Extraction Method for Speech Recognition

In this paper we present a features extractor for speech recognition. The proposed features extraction method based on auditory filter modelling. The latter uses a Gammachirp Filterbank (GcFB), where their center frequencies are selected according to one of the three scales: the ERB-rate scale, the MEL scale or the BARK scale. The performance of the proposed features is evaluated, in the contex...

متن کامل

The Disc as a Bilinear Multiplier

A classical theorem of C. Fefferman [3] says that the characteristic function of the unit disc is not a Fourier multiplier on L(R) unless p = 2. In this article we obtain a result that brings a contrast with the previous theorem. We show that the characteristic function of the unit disc in R is the Fourier multiplier of a bounded bilinear operator from L1(R) × L2(R) into L(R), when 2 ≤ p1, p2 <...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Speech and Audio Processing

دوره 7  شماره 

صفحات  -

تاریخ انتشار 1999